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^■^ Abstract 

, In this paper, we derive new shape descriptors based on a direc- 

tional characterization. The main idea is to study the behavior of 
the shape neighborhood under family of transformations. We obtain 

[- — a description invariant with respect to rotation, reflection, translation 

and scaling. A well-defined metric is then proposed on the associated 

^ feature space. We show the continuity of this metric. Some results on 

shape retrieval are provided on two databases to show the accuracy of 

q the proposed shape metric. 

m 

> 1 Introduction 

•l-H 

X 

Shape characterization is becoming a crucial challenge in image analysis. 
The increasing resolution of new sensors, satellite images or scanners pro- 
vides information on the object geometry which can be interpreted by shape 
analysis. The size of data basis also requires some efficient tools for analyzing 
shapes, for example in applications such as image retrieval. This task is not 
straightforward. If the goal of shape analyzing is to recognize a 3D object, 
for instance for classification or image retrieval purposes, then the data only 
consist of a 2D projection of the object. Therefore, one dimension is "lost". 

*The work is partially supported by RFBR grant 12-01-31294 
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Besides, some noise may affects the object boundary or more precisely the 
silhouette of the object in the considered image. To address this problem, 
numerous techniques and models have been proposed. Reviews of proposed 
representations can be found in [5j E]. One class of methods consists in defin- 
ing shapes descriptors based on shape signatures histogram signatures, shape 
invariant moments, contrast, matrices or spectral features. A shape repre- 
sentation is evaluated with respect to its robustness, w.r.t. noise and/or 
intra-class variability, compacity of the description, its invariance proper- 
ties and its efficiency in terms of computation time. According to Zhang 
and Lu [6] , the different approaches can be classified into contour-based and 
region-based methods, and within each class between structural and global 
approaches. In this paper we consider shapes as binary silhouette of objects 
and concentrate on global approaches. We propose some feature vectors and 
define a metric in the feature space. Following [6], we can distinguish several 
global approaches. Simple global shape descriptors embed area, orientation, 
convexity, bending energy pHE]. Usually, these descriptors are not sufficiently 
sensitive to details to provide good scores in image retrieval. Distances be- 
tween shapes or surfaces have been proposed, such as the Hausdorf distance 
or some modification to reduce sensitivity to outlier [TO]- In this setting, 
the invariance properties can be obtained by taking the minimum distance 
over the corresponding group of transformation. A key issue is to consider a 
metric for which the minimum is computed with a low computational com- 
plexity. Shape signatures based on the boundary give a ID function as for 
example the angle function [TTJ, the curvature or the chord- length [TJ]. Using 
these signatures, a slight change in the contour may result in a big change in 
the signature. Therefore, special care is required for defining a metric on the 
signature space. To reduce the dimension of the representation, boundary or 
surface moments can be used. They usually embed good invariance proper- 
ties and are fast to compute. The geometric moments introduced in [T3] and 
extended, for example for 3D objects in [TJ], are limited in the complexity of 
shapes they can handle. Usually, lower order moments do not reflect enough 
information and higher order moments are difficult to estimate. Preferable 
alternatives are the algebric moments [15] or the fourier descriptors [16J. 
Stochastic models of the shape or the coutour have also been proposed. For 
example, autoregressive models of the boundary provide some shape descrip- 
tors [TTl ITS] . However, the problem of choosing the order of the model is 
still open. Considering too many parameters leads to an estimation issue, 
moreover, the interpretation of parameters in terms of shape properties is 
not clear. Studying the shape at different scales as motivated different work. 
The shape is then described by its inflection points after a Gaussian filter- 
ing [19] . A distance can also be derived by matching scale space images [20] . 
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Finally, the analysis can be performed using spectral transforms, such as 
Fourier J22J [21] or wavelet (2U [23] descriptors. The issues are then to set the 
number of relevant coefficients and the definition of a metric between these 
features. 

In this paper, we derive a 2D signature of shapes and propose a metric 
on the associated feature space. The first idea consists of a description of 
the boundary regularity by comparing the volume of the boundary neigh- 
borhood with the shape volume. The second idea is to study the behavior 
of this descriptor under shape transformations. We thus define a family of 
diffeomorphisms consisting in expanding the shape in one direction and con- 
tracting it in the orthogonal direction. In that way, for a given detail, there 
exists at least such a transformation enlarging its contribution to the de- 
scriptor and another one reducing it. We then derive a well defined metric 
on the feature space, and show its performance for shape discrimination on 
databases of various size. 

The paper is organized as follows. We describe the proposed shape space 
and define a metric on it in section [2] A discretization of the metric is 
described and evaluated on two different databases in section [3} Finally, 
conclusion and perspectives are drawn in section |4j 

2 A topological description of shapes 

We consider shapes as 2D silhouettes of bounded objects in the image plane: 
2.1 Shape space 

Definition 1. The pre-shape space S is the set of subsets of IR 2 satisfying 
the following conditions: 

CI: Va G S, a is compact and connected, with a strictly positive area, 

C2: Va 6 5, IR 2 \ a is connected (a has no hole). 

Let us consider a shape a G S. Define the closed ^-neighbourhood of the 
set a in the sense of the Euclidean metric as O e (a) = {x G IR 2 : e(x,a) < 
> 0, where e(-, •) is the Euclidean distance. 

On this pre-shape space, we consider the Hausdorff metric (which is well- 
defined, see, for example, [3]) for the sets in IR 2 : 

p(a, b) = inf{<5 > : a C 5 (b), b C 5 (a)}, 

where a, b C S. 
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A shape space should embed some invariance properties. Let G be the 
group of transformations of M 2 generated by rotations, translations, reflec- 
tions and scaling : G = SOf (R) x R + . To define a shape space § isometry- 
and scale-invariant, we consider: 

§ = S/G. (1) 

For a given A G §, we note r(A) = {a G S : vol(a) = l,G(a) = A}, where 
vol(-) is the area of the set. 

Therefore, on the shape space S, the Hausdorff metric becomes: 

d(A, B) = inf {p(a, b) \ a e r(A), b G r(B)} (2) 

where A, B G § (note that this metric can be compared with the Procrustes 
distance for sets consisting of finite number of points [HE]). 

Proposition 1. d(-, ■) is a well-defined metric on §. 

Proof. Let us consider A, B, C G §. It is straightforward that d(A, B) = 
d(B, A) and d(A, B) = <^ r(A) = r(B) due to the compactness of the con- 
sidered sets. Then, we only have to check the following property: d(A, C) < 
d(A, B) + d(B, C). Suppose that there exist A, B, C such that 

d(A,C) > d(A,B) + d(B,C). (3) 

Let 6 := d(A, C)-d{A,B)-d{B,C) > 0. By definition there exist a G r{A), 
bi,b 2 G r(B), c G r{C) such that 

(pi) P {a, h) < d{A, B) + 8/4, , . 

(p 2 ) P (b 2 ,c)<d(B,C) + 5/A. {V 

Then 3g G G : b\ = g{b 2 ) so that (p 3 )p(b 2 ,c) = p(bi,g(c)) and g(c) G r{C). 
Therefore, we have C\ = g(c) G r(C), p(b 2 ,c) = p(bi,Ci). 
We have: 

( Pl ) ^b.C O d ^ +5 ^(a), (p 2 + p 3 ) Cl C O^^+^ih), (5) 
and: 

(p x ) =► a C O^+^C&O, (ft + Ps) ^iC 0^+ 5 / 4 ( Cl ). (6) 
Therefore: 
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c Qd(B,C)+8/4+d(A,B)+5/4^ 
c Qd(A,B)+5/4+d(B,C)+6/4( Ci y 



(7) 




(8) 



□ 



2.2 Volume descriptor and family of transformations 

The main idea of the proposed description is to characterize the behavior of 
shapes under some transformations. These transformations aim at enlighting 
small characteristic details. We first consider the volume behavior under 
some dilation. Intuitively, this volume will increase more for sinuous shape 
boundaries than for smooth shapes. 

Let us consider a shape a e S. Idea of our shape descriptor is based on 
analyzing the fraction 



where vol(-) is the area of the set (for a C Z 2 , it would be the number of 
pixels). This parameter is well-defined as we only consider nonzero area set 
a. 

Basically, the proposed feature study the evolution of the ratio between 
the neighborhood volume and the volume of the shape after some dilation. 
It provides some information on the smoothness of the contour and on the 
size of the contour concavities. To complete the description, the next step 
consists in enlarging details in shapes for a more robust discrimination. Be- 
sides, we consider a directional analysis by defining family of transformations 
parametrized by an angle and coefficient of expansion. 

We consider a family {Ftgm} of linear transformations of M 2 in order to 
obtain more significant information about shapes. The goal of such transfor- 
mations is to emphasize "features" of the shape in specific direction. These 
transformations are defined as follows: 



where 6 e [— f , f], j3 > 1. Every is /3-times expanding in one direction 

and /3-times contracting in orthogonal, so it is a volume-preserving transfor- 
mation. 




(9) 




(3 cos 2 6 + ±sm 2 d (f3 - ±) sin# cosfl 
(13- |)sinflcos# fi sin 2 9 + \ cos 2 6 



(10) 
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Figure 1: Hand (left), after transformation (middle), and after trans- 

formation F(Q t 2) 



For every set a C R , we obtain the map 
a -> P e (a) - v °K° £ ( F W) a )\ F W)Q) „ , vr tt 

(li: 

It is clear that Pf e g) (a) is a continuous function of e, 6 and /3 and Pf* «\(a) = 



On figure [I] is shown a hand shape a for which P^ gi ^(a) = 0.23. After 
transformations, we obtain respectively P( 02 ){ a ) — 0.33 and -P(g 02 )( a ) = 0.20. 
Therefore, when expanding the shape in the finger direction, the descriptor 
increases while slightly decreasing when contracting along this direction. 
Consider i? 7 , the rotation by an angle 7. We have the following property: 

Proposition 2. V0 e [-f , §] , P( m (i? 7 a) = Pf (0+f +7)mod(7r) _ f ^(a) 

Consider R x , the reflection with respect to (horizontal line). We 

have the following property: 

Proposition 3. V0 G [-§, f ], P^(R x a) = P^ e>p) (a) 

Let us denote R the group of transformations generated by properties [2] 
and |3j The shape representation space 71 we consider for the fixed e > is 
then defined by the following mapping: 



$:§ ^ = C°([--,-]x[l,oo])AR 

A ^ P £ W) {a)/R, aer(A). (12) 

On figure |2j we can remark that the function P^ ^ (a) increases more in 
two directions, corresponding to an extention of both the calf body and its 
legs. On figure |3j the biggest slope is obtained when expanding the wings of 
the birdfight. 
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Figure 2: Calf (top left) and the associated representation P n (Fr 1; p\). The 
bottom line represents i^m for (3 = 2 and 7 = 7r/4, tt/2, —it/ 4, (the shape 
is in black and the neighborhood in grey). 
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Figure 3: Birdfight (top left) and the associated representation P n (Fr 7j p\). 
The bottom line represents for (3 = 2 and 7 = 7r/4, 7r/2, — 7r/4, (the 

shape is in black and the neighborhood in grey). 
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We then consider the following metric on 1Z: 



1(<$>(A),$(B)) :=inf 



a.b 



( 



X 



\ 



1/2 



(P( m (a) - P( m (b)) 2 e-^dP d6 



]x[l,oo] 



/ 



(13) 

where k > 0, a G r(A), b G r(B). The integral on the right-hand side 
converges due to Pf B m (a) is almost linear function of (5 for (3 big enough. 

We thus have defined a map between the shape space and the feature 
space. Two similar shapes should be associated to close points in the feature 
space. This property can be established by the continuity of mapping $ with 
respect to the metrics defined in both spaces. 

Proposition 4. $ : (S>, d(,)) — » (1Z,1(,)) is a continuous map. 

Proof. We consider shape A G §. For the a > we would like to find 5 > 
such that 

d(A,B)<5=>l($(A),$(B))<a. (14) 

Note, that Va G r(A), Pf e fi){ a ) = vol{0 £ (F {B ^a)) - 1. 
First we choose (3q such that Va G r(A), b G r(B) 



J (P( m (a)-P( m (b)) 2 e-^dpd6< 



[-§,f]x[ft),oo] 

< y (C/fe) 2 e- K/3 a73 d# < a 2 /2, 
[-f,f]x[fl, l0 o] 

where C depends only on diameter of a. 
By definition there exists a, b: 

a C 5 {b), b C 5 (a), 



(15) 



where a G r(A),6 G r(5). Equation (16) implies that V#,/3: 

F m (a) C O' 35 , F (m (b) C O^ 5 (F ( ^(o)) . 

Using (tT7l) we choose 5(0, 0) > such that Vo" < 5(0, /3), /3 < /3 



(16) 



(17) 



|Pf^)(6)-P ( ^)(a)| 



= \vol(0 £ (F (e ^b)) - vol(0%F [m a))\ 2 

< (vol(0^\F W) a)) - vol{O e {F W) a))) 2 

< a 2 /2w, (18) 
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where w = J\_k k] x \x p ] e Kl3 d(3 d6. It is easy to verify that 5(9,(3) is a 
continuous function over compact set [— |,f] x [1, /So]- Let mm 5 (9, (3) = 
5o > on this set. 
Finally, for 5 < 5 : 

l(*(A), $(P)) 2 < J (P{ m (a) - P( m {b)) 2 e-^dp d6 < 

<[ (...)e- K ^d(3d9+ [ (...)e~ K ^d/3d9 <wa 2 /2w+a 2 /2. 

(19) 

And so 

Z($(A), $(£))< a. (20) 

□ 

3 Implementation and results 

3.1 Discretization 

In practice, to compute the distance between two shapes, we have to dis- 
cretize equation [13} When analysing the surfaces representing the function 
P n (F(e >( g)) on figures [2] and [3j it is clear that the embeded information is 
redundant. Indeed, the surfaces are very smooth, so that we can employ a 
drastic discretization scheme, without loosing information. 

Before computing the proposed feature, we normalize the shapes to V- 
area in order to satisfy the scale invariance (notice that our descriptor is 
invariant with respect to isometries). Indeed, for {a, b} G S x S of the same 
shape but with different volume, we have to choose different n a and nj (see 
rt9J)) in order to obtain P na (a) = P nb (b). That is why we "normalize" every 
set to some fixed area V > by the corresponding homothety. 

We consider four directions, 9 G {— f , 0, j, 7r}, and two expanding coeffi- 
cients (3 G {3, 5}. 

3.2 MPEG-7 CE Shape-1 Part-B data set 

We first evaluate the proposed approach of the MPEG-7 CE Shape-1 Part-B 
data set (see [2"5]). composed of 7 classes, containing each 20 shapes. Al- 
though the classes are quite distinct, this data set contains important within- 
class variations (see figure 
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Figure 4: The MPEG-7 CE Shape-1 Part-B data set 
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Table 1: Retrieval scores on the MPEG-7 database 



We consider four directions, 9 G {— | , 0, f , 7r}, and two expanding coeffi- 
cients (3 G {3, 5}. The coefficient defined the metric is n = |. 

We consider the proposed metric between each pair of shapes (except 
itself of course, cause distance is 0) and report in Table [I] the percentage of 
correct n th neighbors for each class. The total correct answers correspond to 
98% for the first neighbors and 95% for the second neighbors. If we consider 
the tenth neighbors, we still obtain a total score of 85% of good retrieval. 
This shows the robustness of the proposed metric. 

3.3 Kimia database 

We now consider a database defined by Kimia. It consists in 676 shapes 
divided into 27 classes (see additional material). The global retrieval score 
is 94% for the first neighbor, 75% for the second neighbor, 69% for the third 
neighbor, 70% for the fourth neighbor and 67% for the fifth neighbor. The 
results obtained for each class are summarized in table [2] They show the 
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Table 2: The Kimia database classes and the percentage of good neighbor 
retrieval 



Figure 5: A hand shape occluded with a bar and its three first neighbors 

robustness of the proposed metric in case of a huge database. However, 
notice that we do not model the objects themselves but only consider shape 
descriptors without semantic interpretation. Therefore, the proposed metric 
is not adapted to occluded shapes. On figure [5j the three first neighbors of 
a hand, occluded by a bar, are elephants. This result is natural considering 
the number and the size of growths and their angle distribution. 

4 Conclusion 

We have proposed a new metric on a shape space based on the shape prop- 
erties after applying family of transformations. The proposed metric is well- 
defined and continuous. Retrieval results on two databases, one of them 
consisting of 676 shapes, divided in 27 classes, have proven the relevance 
of this metric. We are currently studying the injectivity of the associated 
mapping. Further studies also include the definition of a shape classification 
algorithm, based on this description. 
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